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PRELIMINARY REPORT ON AFSAM 9' DEPTH STUDY 
INTRODUCTION ! 

One of the greatest difficulties encountered in assessing the security 
of cryptographic devices which operate on principles similar to that of the 
AF3AM 9 is the determination of the probability of depth. It has been 
possible to derive satisfactory a 1 priori estimates of the probability of 
depth for the class of structures wherein each setting has a unique pred- 
ecessor; e.g. the AFSAM 7* However, a 1 priori estimates of the probability 
of depth, are not at all adequate for structures, such as those generated 
by the AFSAM 9 % which are comprised not only of single points (i.e. settings 
which have a unique predecessor), but also of critical points (e.g. branch 
points and origins) • 

Thus the only practical means of ascertaining the incidence of depth 
for structures containing critical points is by means of sampling. As to 
what specific sampling procedures should be employed, there are obviously 
very many theoretically sound procedures one could propose ; however, whether 
or not a particular sampling procedure is feasible will be largely determined 
by the kind pf high speed analytic equipment that is available. 

In Fart I of the report which follows is given a description and 
analysis of the sampling procedures that have been devised to determine the 
incidence of depth, for AFSAM 9 traffic together with a description of the 
various types of statistics that can be secured by utilising these pro- 
cedures. In Fart II of the report* the identification and evaluation of 
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those notch pattern parameters, on which the incidence of depth depends, 
are discussed. 

Part 1 

The sampling procedure which is currently being used in the study of 
the APSAM 9 structure utilises the facilitiee afforded by PLUTO* Another 
sampling procedure which will make use of ATLAS II is currently being 
programmed and will be ready for uee in the tear future* The first half 
of Part 1 will describe the PLUTO program* This will be followed by a 
description of the ATLAS II program* 

The sampling procedure on PLUTO is logically (but not chronologically) 
equivalent to the following steps t 

(1) A quasi-random initial setting is first generated (by a method 
to be described later)* 

(2) With the wheels set to this Initial setting, the AF9AM 9 is then 
stepped through a message of length 5000, and the 5000 th setting is then 
stored in a '•memory." 

(3) The wheels are new set to a new quasi-random initial setting, 
and the device is then stepped through a second meeeage. Bach subsequent 
setting that is generated is compared with the setting in the "memory." 

The results of this comparison can be characterised by the following 
figures, where M. is the Jth setting on the 1th message i 
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*1,1 M 2,l **L,1 * 2,1 



\ 



5000 



*S .,5000 



Hj k (k > 5000) 



2,10000 



2,10000 



Case I Case II Case III a,b, 

If there is a hit before the device has stepped more than 5000 tines, 
as in Case I, the first message is in depth with the second for at least one 
setting* If there is no hit before the second message has been stepped 
through 10,000 settings, as shown by Case II, the two messages are not in 
depth. 

(4) If there is a hit before the second mssdage has been stepped 
through less than 10,000 settings, but more than 5»000 settings, as indicated 
by Case III, the 5000th setting of the second message is then inserted in the 
"memory, " and each setting of the first message is noV compared with the 5000 th 
setting of the second message. If there is a hit before the first message has been 
stepped through 5000 settings, the two messages are then known to be in depth for 
at least one setting. 
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Case III a Case III b 

Depth No Depth 

(5) If the first and second Messages are found not to be In depth, «* 

new quasi-random initial setting is generated and this }hird message is then eon- 
pared for depth with each of the first two messages in the manner outlined 
in steps (1) through (A) . 

(6) Sampling according to the steps prescribed in (1) through (5) is 
continued until same message j is found to be in depth with some message i, 
where j > i. The "statistic J" thus obtained corresponds to the number of 
messages that had to be sent in order to insure that the jth message was 
the first one yielding depth with any prior message sent in the same crypto- 
period. If an adequate number of runs are made, say n, yielding the dis- 
tribution of ths J's, this statistic can then be utilised as a reliable 
criterion to determine the amount of traffic load that the AFSAM 9 can 
safely carry in any given cryptoperiod. 
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In a similar fashion other types of statistics can also be obtained* 
such as a statistic corresponding to the number of messages which have to be 
sent in order to insure that exactly k pairs ot messages be in depth for at 
least r settings; or a statistic corresponding to the number of messages which 
have to be sent in order that k n-tuples be in depth for at least r settings 1 * 
The above sampling procedure was not feasible until Just recently* as 
the time involved in obtaining a single statistic of the type J required 
about 120 hours* The samplihg process has now been almost completely 

l 

automatized by devising a program which enables PLUTO to generate its own 
"quasi-random" initial settings. At the present time PLUTO can obtain a 
j statistic in about 45 minutes. 

Concerning the sampling procedure currently being used on PLUTO* the 
feature which is most likely to incur adverse criticism is the method by which 
initial settings are generated* The initial settings of messages that are 
to be used in obtaining an observed value of J are generated in the follow- 
ing manner. In addition to the actual AF5AM 9 notch pattern^ separate 
randomly constructed notch patterns are inserted on the wheels W-2, W-3, W-5, 
W-6, W-7 and will be referred to as the secondary patterns. The wheels are 
now initially set to some randomly selected setting (designated here as the 
primary setting) and stepped through message 1. After the final setting of 
the message has been generated* the rules of motion of the AFSAM 9 are amend- 
ed by the addition of the following rules: 

1. W-l, W-2* W-5 and W-6 are delayed by the simultaneous occurrence 
of a no-notch on the secondary pattern of W-5 and a no-notch on an auxiliary 



5 




uaUHtiT 



two point wheel which steps every time* 

2 . W-i Is stepped by the occurrence of a notch on the secondary 
pattern of wheel ¥ - ( 1 -lX# where 1 * 3 , k, 7 * 8 , 

The wheels are now stepped according to the amended rules of motion 
for some fixed Interval, say 10,000 steps. The 10,000 th setting arrived 
at under the amended rules of motion is now defined as the initial setting 
for message .2. By making this procedure iterative, a set of initial settings 
can thus be generated. 

It is apparent that the above procedure will ultimately result in the 
generation of a cycle, so that after a certain point in the iteration the 
process will become periodic. There is therefore a limit to the number of 
initial starts that can be generated by the above procedure. Since it is 
easy to ascertain at Just what point the process becomes periodic, this limit 
is determined and is never exceeded. 

Inasmuch as the AFSAM 9 notch patterns are held constant throughout 
any given run, it is the secondary patterns and the primary setting whioh 
determine what initial settings are used in securing h statistic. It is 
therefore important that a new random primary Betting and a new random set 
of secondary patterns be selected for each statistic that is to be obtained. 

In regard to the selection of the amended rules of motion referred to 
in the aforementioned procedure for generating initial settings, the selection 
was not completely arbitary but was governed by the following factors t 

(a) It is desirable that the amended rules of motion be easy to 
implement on PLUTO* 
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(b) In comparison with the actual AFSAM 9 structure,, the 
structure generated by the amended rules of motion should not be comprised 
of a disproportionately large number of confluences. 

(c) It has been shown in £l} that the cycle structure of the 
AFSAM 9 is comprised of 108 disjoint sets of structures, where the selection 
of the initial setting of W-l, W-2, W-5 and W-6 determines which of the 
structures is being used. It was desired that depth studies be confined to 
a particular one of the 108 mutually exclusive sets into which the structure 
of the AFSAM 9 can be resolved, rather than to study the entire structure. 
Therefore, a set of amended rules of motion in order to be satisfactory must 
maintain the relationship of the settings of W-l, W-2, W-5 and W-6 necessary 
to this chosen one of the structures. 

Inasmuch as it is desirable that any method of generation of initial 
settings simulate a random process, one would like to know if it is 
statistically feasible to assume that the initial settings generated by 
the procedure described above could arise by random sampling from the 
population at large. To determine whether or not the hypothesis that the 
generated initial settings could have been obtained by random selection can 
be statistically rejected, one could devise any number of tests to exhibit the 
degree to which the method of generation of initial settings is characterized 
by mathematical randomness. However, it should be pointed out that even 
though one were able to reject statistically the hypothesis that the generated 
initial settings are mathematically random, the method of generation might 
still be acceptable. That is, a set of initial settings may fail to exhibit 
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characteristics of mathematical randomness and yet be dispersed so 
haphazardly over the, entire AFSAM 9 structure that their use does not 
invalidate any statistical analyses pertaining to depth studies. All that 
is required here as regards the conditions of selection of initial settings 
is that there should be no marked element of preference or bias that would 
tend toward the inclusion or exclusion of certain local areas of the AFSAM 9 
structure. 

Suppose, then, one were to compare several prominent characteristics 
(such as precentage of frequency of occurrence of the structures, frequency of 
incidence of certain entry points, expected length of lead-in for each of 
several entry points) of the statistical picture of a particular AFSAM 9 
structure obtained from N mathematically random initial starts with these 
same characteristics of the statistical picture obtained from N initial 
settings that have been generated by the method in question. If it were 
found that in both oases these characteristics were substantially the same, 
it would then eeem quite reasonable to regard the method of "pseudo-random 11 
generation as satisfactory for the pdrpoee of depth study. 

A comparison teat of this type, based on a sample of 1000 initial starts, 
has been applied to one of the 108 sets of structures generated by the AFSAM 9 
for an arbitrarily chosen eet of notch patterns. The results of this test are 
described in Appendix A. 

The ATLAS II procedure involves a more direct approach to the solution 
of the depth problem, simply by answering the question, "How many messages 
can be sent which will be in depth with a given messaged" This number can 
be found by reproducing that portion of the cycle structure which contains 
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all paths leading into the given message and for which no path exoeeds the 
maximum message length } say 5000. In order to simplify the problem and thus 
decrease machine time, the assumption has been made that the probability of 
depth with an indicator system (restricting the number of possible message 
starting points) is not significantly greater than the probability of depth 
with no restrictions. At this writing there has been no conclusive evidence 
denying this assumption. 

A more detailed description of the ATLAS program follows: In a manner 
analogous to a cycle study, whereby a picture of a particular cycle structure 
is desired, consider the message as a "cycle" of length 5000. The entry points 
to this "cycle" are then all the branch points on the message together with the 
initial or starting point of the message. Then the lead-irmwill be the paths 
taken by other messages such that depth occurs. 

The method of obtaining this sub-structure is a two-phase procedure. 

The first phase involves normal (forward) stepping of the device from some 
setting to produce the message as far as setting M^qqq* The second phase 
uses reverse (backward) stepping starting from the branch points and arK * 
continuing 5000 points or until an origin terminates the stream. This can be 
shown graphically in the following diagram, where 0 represents an origin, B 
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In order to simplify a description of the ATIA9 procedure, consider 
the following diagram, where an arc centered at £he message branch point is 
drawn through each successive point in the structure (backward) such that a 
point on arc A^ is i settings removed from the message branch point* 




Two or more points on the same arc have in comnon the 3-wheel COM setting 
contained in the AFSAM 9 setting. Furthermore, the motion of the remaining 
five wheels proceeding from a branch point (necceBsarily of multiplicity two) 
in one direction is the complementary motion of the opposite direction. These 
two statements constitute the basis of the counting procedure, whereby successive 
CCH settings account for the 1 associated with along one stream (direction), 
and the number of such streams f^ intersecting or terminating on A^ is 

f i * f i-l + B i-1 * °i-l 

where ^ and 0^ are the number of branch points and origins respectively 
along Aj_]_« Hence the totality of points in this sub-structure is 
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If one calls the stream originating (backwards) from the message 
"machine number 1", or mp then In determining the two predecessors of a 
branch point, let the predecessor that is reached by moving Wj (one of the 
5 wheels not contained in the three CCM wheels) originate a new machine, 
while the other predecessor will continue along m^. Continue this procedure 
for each branch point. Then at some arc Aj there will be f^ machines still 
active in the backward generation and counting procedure. Therefore the 
procedure is first to move the CCM wheels; secondly (for each m^) to determine 
whether the point is a single point, branch point or origin (simultaneously 
finding the motion pattern) i finally to move the wheels if necessary. If 
is at a single point, no "machine" change is necessary. A branch point 

involves setting up a new "machine", while an origin causes that "machine" 

* 

to be made available for a new branch when needed. 

The preceding paragraphs show how T, the total number of points from 
which a depth of at least one can occur, is found. In general onw is looking 
for Tp, the number of points yielding a depth of at least D > 1 : 



5000-D 




Then the total sub-structure will give ■ E Tp + $000>(D-1) points which 

will result in a depth of D, where the summation is over 

the entry points to the message, say H^, where 1 < 5000-(D-l). Other Counts 

i 

to be tabulated are the number of branch points and origins in the sub-structure. 

Let the total number of points in the structure be designated by N. With 
each of several sets of notch patterns, a sufficient number of runs will gire 
enough statistical data to yield p ■ Np/fj* which can be used to estimate the 
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true probability of depth » 

Part II 

In any empirical approach to the problem of determining the incidence 
of depth, the question quite naturally arises as to whether one should con- 
fine his study to several setB of notch patterns that have been deliberately 
selected so as to amplify or exaggerate certain attributes of the notch 
pattern population at large; or whether one should employ the msthod of random 
, selection. If enough information concerning the relation between the notch 

s 

pattern parameters and the genealogy of the AFSAM 9 structure is available, 

the former procedure would seem to be the more expedient. It therefore is 

' desirable to devote some effort to the problem of identifying and evaluating 

those notch pattern parameters of which the incidence of depth is a function. 

The most important attribute affecting the incidence of depth is the 

number of branch points that are contained in the motion structure generated 

by a set of notch patterns. Let N (B> be the function which relates the number 

of branch points to the notch pattern parameters. The CCM settings distribute 

themselves into three classes Of else (36) /3» since (21,15) ■ 3* One of the 

criteria for a branch point is that W-3 be on a minus (-) . Hence the number 

2 

of settings of the CClf which nay yield branch points is (36) 15/3 ■ 6480. 



The 16 branch point patterns oh 5 wheels give 
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b 4 B ? Bg£iB 5 (15*« 6 ) ♦ *B 6 (15*C^y 
where is the number of no-notches proceeded by no-notches (— ) on W-l. Hence 
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N (B) - 6480 By B g [Uj (15^) ♦ 4B g (15 +Cj)], 

- 25,920 B U By Bg [B 5 (30-B 6 ) +‘B 6 (30-B 5 )] 

- 51,840 B fc By B g [15 (B 5 *B 6 ) - B 5 B 6 ] 

In the above expression B^ (i ■ 4, 5,6,7,®) corresponds to the number of 
blocks of which the notch pattern on wheel Is comprised. The number 
of blocks is defined as one half the number of changes of sign. Because 
of the criteria that are used in defining whether or not a given notch pattern 
is an admissable one, N (B) is independent of the patterns on W-l, W-2 and 
W-3, and the B^ are limited to the range of values 5 < B^ £ 15 • It is 
apparent that the function N (B) is maximized when: 

(1) Bg - ft, - Bg - 15 

(ft) 8 - 15 and/or Bg - 15 

and is minimised when: 

(1) B ± - 5 (i - 4,5,6,7,8) 

Therefore, the number of branch points in any AFSAM 9 structure cannot 
exceed 118, 098, 000, 000 j nor can the number of branch point b be lees than 
2,430,000,000. If the maximum number of branch points that is realizable 
is defined as 100 percent saturation, it then follows that the totality of 
branch points can never be less than 2.05 percent of saturation. 

Statistics relevant to the incidence of depth which have been secured 
for a particular set of notch patterns would be much more significant if 
the probability with respect to a given notch parameter of selecting the 
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set is known. For example, if a set of patterns is selected which generates a 
structure with the number of branch points classified as IB-L9 percent 
saturation, it would be very useful to be ablo to estimate the probability of 
securing by random selection a set of patterns for which the number of branch 
points would have a classification which exceeds k9 percent saturation* 

Because the degree of branch point saturation is the most important 
attribute affecting the incidence of depth, it is expedient to devote some 
time to the problem of deriving a probability distribution which can be used 
to determine the probability of selecting a set of patterns for which the 
degree of saturation is greater than x percent. Inasmuch as the number of 
branch points is, in terms of the notch pattern parameters, a function 
only of the number of blocks, the problem of obtaining such a distribution 
obviously entails a means of counting or estimating the number of admiseable 
notch patterns which have i blocks (5 ^ i 15) • The problem of securing 
such counts is not exactly trivial, but could be programmed for ATLAS II. 
However, the probability distribution for the population of patterns which 
satisfy only that portion of the criterion of admiseability which requires 
that there be no mors than five contiguous notches or no-notches can be 
obtained directly from known results obtained in a report by B. Harris, 
NSA-3U, "On the Number of Cyclic Binary Patterns”. This distribution 
appears on page number 15, 
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DISTRIBUTION A 



Theoretical 



1 


P1U 


Rearouoed P(i) 
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48. 


8 
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10 


0.294 


.294 


370. 
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217. 
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0.058 


.058 


72. 


13 


0.011* 




14 


0.001 


.012 


15* 
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The working hypothesis was now made that the above distribution 
for the population of patterns which are only partially admissible 
does not essentially differ from the distribution for the population of 
adndseable patterns and could therefore be used to estimate the pro- 

portion of admissable patterns which have 1 blocks. This estimate 
was then incoproreted in to an ATLAS program to obtain the desired 
probability distribution, which is given in Appendix B. 

Empirical results which lend eredehce to the working hypothesis re- 
ferred to in the preceding paragraph were secured in the following manner. 
The distribution of the number of admissable patterns which have 1 blocks 
was obtained for a previously constructed and available sample of 1258 
fully adndssable patterns. (For a description of the method by which 
these admissable patterns were constructed, see reference QQat the end of. 
this report) • This sample distribution together with the computation of 

is given on page 16. 
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DISTRIBUTION B 



, 2 
(fo - $ ft 

.021 

.049 
.045 
.011 
2.250 
1.681 

-..-*643 

Lm 

2 

From a table of ^ it la seen that when the number of degrees of-frSedom 
2 

is 6 a value of X as great or greater than 4.779 would occur purely as a 
result of chance In more than two out of three random samples. Therefore 
the working hypothesis that distribution A is not essentially different from 
distribution B is entirely consistent with the above results. 

An examination of the distribution in Appendix 8 shows that the 
probability of securing by random selection a set of adndssable patterns having 
a percentage of saturation greater than 50 percent is less than 0 . 0005 . 1 f. 
therefore, it were known that the only important attribute affecting the 
incidence of depth is the number of branch points comprising a structure, 
statistics relevant to the incidence of depth could then be secured by the 
procedures described in Part I for several sets of patterns having, say. 
from 48 percent to 50 percent saturation. These statistics could then bs 
used as criteria to determine the amount of traffic load that can be safely 
tolerated, since the probability of a situation arising for which these 
saturation criteria are not satisfied would be less than 0 . 0005 * 
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The identification and evaluation of other notch pattern parametere 
which affect the incidence of depth are therefore matters of practical 
importance. Before attempting to identify any other pattern-parameters which 
might affect the incidence of depth, however, it is first necessary to 
determine how the genealogical characteristics of a general structure of the 
AFSAM 9 type are related to the incidence of depth. Those characteristics 
of the structure which do significantly affect the incidence of depth may 
perhaps then be expressed as some function of the pattern parameters, as has 
already been done for the function N (B). 

An investigation of the characteristics of a general structure revsAls 
that there are three distinct types or classes of branch points. Those 
branch points which have as their two predecessors (either imnediate or 
removed by one or more single points) two branch points are here classified 
as type I. Those branch points which have as their two predecessors two 
origins are classified as type II, and those which have as their two 
predecessors one origin and one branch point as type III. 

In terms of these three classes of branch points, the two limiting 
distributions for any tree comprised of N critical points are: 

Distribution I Distribution II 

Type I N/2 1 

Type II N/2 ' 1 

Type III 0 N-2 
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(The configuration comprised of an n -cycle branch point together with its 
entire associated off -cycle network of predecessors is here referred to ad 
a "tree".) 

From the illustration* of two trees which are given below, it id 
apparent that for structures comprised solely of critical points the 
function F (P), corresponding to the probability of a pair of messages 
being in depth, is maximised for the set of trees having the limiting 
distribution I* 

Distribution 1: N *32 Distribution II: N * 32 




TfPE II TREE 
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The function V (Pj) has been formulated for 'the particular Distribution 
I case for which N = 2* : 



F (Pi> - f 2 |[3 (ar -IJK. (2-11.2 



L-2 



L-l _L-1 



In the above expression L is the length of the message and M is the weight 
of the tree, i»eo the total number of points or settings of which the tree 
is comprised «, The function ? (Pj) gives the exact probability of depth for 

C 

messages of length L when N is a power of two, and is a reasonably good 
approximation for all other admissable values of No (Trees having the 



limiting distribution I require that N~l0 (mod &) 0 while those having tho 
limiting distribution II require that NsO (mod 2) ) 0 

The corresponding function F (Pjj) has been formulated for all 

i 

admissable values of N, and appears below: 




It is apparent that when L ° 2, 

f (pj) - f (p u ) - -^r[ N “ l J p 



and that when 3, F (P^) > F (Pjj) , Moreover, when L is approximately 
50 or more and W L, the ratio 



F (Pj) 

fTSJJ 



1 

2 




L-2 % . L-l L-l 

-1) W - (2 -1) 2 

L (N+2) - L 2 - (N+l) 
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lt would thus appear that the type of branch points of which a structure) 
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is predominantly comprised is a factor which does significantly affect the 
incidence of depth* 

It has been conjectured that the distribution of single points be- 

} 

tween branch-origin segments and branch-branch Segments may materially affect 
the probability of depth* In accordance with tills conjecture, it is believed 
that the most favorable distribution consists of distributing single points 
as uniformly as possible between' branch-origin segments of the tree* Therefore, 
one may conditionally regard the distribution of single points as a factor 
affecting the incidence of depth* 

As regards this third factor, it can be demonstrated that the 
distribution of single points between branch-branch segments of the structure 
cannot be nade to differ .substantially from the distribution of single points 
between branch-origin segment a j so that it is impossible to distribute the 
single points in the ideal manner described above. TO illustrate this fact, 
consider the functions (C^,Wj,X^), F 0B (C^,Wj,X k ), G BBB anc * 

Gqbo ^ C i ,Vr j ,X k ,Y l)» where P B0 determines the number of branch points which have 

' exactly one immediate predecessor that is an origin, F__ determines the number 

Bd 

of branch points which have exactly one immediate predecessor that is a branch 

point, G determines the number of branch points whose two immediate 

predecessors are branch points, and determines the number of branch points 

whose two immediate predecessors are origins* it Can be proven that the 

function corresponding to the number of branch points whose two inmediate 

predecessors Are a branch point and an origin is always null for AFSAM 9 

structures and therefore need not be considered* The functions G„_, 

Wf OBO 
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P i and G B are expressed below In terms of the notch parameters 

BB BBS 

C^« VI y and Yp where corresponds to the number of blocks on Wp Wj 

corresponds to the number of 010 configurations on Wj, 

number of 10i configurations on W. , aAd Y, corresponds to the number of 

\ 1 

000 configurations on In all of the following expressions Hqq i» the 
function corresponding to the number of 00 configurations in the dilated 
output stream of tf-3 . 



X fc corresponds to the 



f B 0 “ »00 (V 15 ^ 5 > w 6 *7 °8 * h °5 ( 15 -° 6 ) °7 °« ♦ c fc <«-C 5 > w 6 *7 *« 

♦ 0 4 o 5 (15 -c 6 ) 0 7 x g ♦ C k C 5 (15-C 6 ) X ? C 8 ♦ X t c 5 ( 15 -c 6 ) w 7 w g 

♦ c t (15-Oj) Xg o ? c g ♦ x 4 ( 15 -c 5 ) x 6 c 7 w g ♦ C t (15 -c 5 ) c 6 x 7 0 g 

♦ 0 t x 5 ( 15 -o 6 ) <7 c g ♦ w 4 (X5 -Cj) 0 6 w 7 W g . w 4 x 5 ( 15 - 06 ) Of » g 

♦ Wg W ( 15 -c t ) *7 Wg * w 5 (15_C 6 ) m 7 *8 * W 4 (15_C 5 ) ° 6 °7 °8 

♦ 0 4 <«-« 5 > Cg C 7 X g ) - 0ogo 

G 0 S 0 ■ Woo ( Z U *5 *6 *7 *8 * *6 w 5 *6 *7 *8 * W 4 *5 *4 *7 *8 

« W t w 5 r g X 7 Xg * * 5 (15-c 6 ) Xy Wg ♦ X t i, (15 -c 6 ) w 7 w g 

‘ Wg ( 15 - 0 j) Xg "7 Wg * X t (15_C 5 ) Xg x, w g ) 



21 




SE C RET 

tgg * ««, ix t e s (15-C S ) x 7 c 8 . c 4 t f (15-c 6 ) 0 7 x g ♦ « 4 (ivc,) cj x ? 0g 

* C t (15-O5) Jtg 0 7 x s * x 4 fl5-Cj) xg 0 7 C 8 . c 4 <15 -c 5 ) o 6 w ? Xg 

♦ w 4 x ? (X5 -c 6 ) C 7 c 8 * c 4 c, (15-cg) k 7 x„ ♦ Xg Oj (15 -o 6 > C 7 w g 

* °i, "5 <«-C6> *7 Og * w 4 <15 -c 5 ) C 6 C 7 *g * 0 k (15-O5) W 6 x 7 Cg 

♦ c 4 (15 -c 5 ) c 6 c 7 Cg ♦ x 4 (15-c 5 ) *g W 7 *g . C 4 c 5 (15-Cg) c 7 c 8 

W 4 W 5 <15-c 6 ) H 7 ¥g) - Oggg 



°BBB • «00 "5 T 6 *7 V h Y 5 *6 x 7 *8 * *4 *5 H 6 *7 *8 * \ M 5 *6 "f *8 

♦ x 4 x 5 <15 -o 6 ) x 7 Xg * ¥ t (15-C 5 ) x 6 i 7 Xg . I 4 (15 -c 5 ) Xg W 7 Xg 

♦ w 4 Xj <15-Cg) » 7 Xg> 

The function <^Hqq has not been explicitly formulated in terms of the 
notch pattern parameters o£ W-l, W-2 and W-3 ; however,, empirical study 
strongly suggests tfcat is relatively independent of the patterns on 

l 

W-l and W-2 and is primarily a function of the number of OO configurations 
in the pattern on W-3« In any case, for any given set of patterns it is 
easy to determine by actual count the exact value of Hqq. 

A cursory examination of the functions F^, Fgg, ®BEB* 61111 ®OB0 rovea1 ' 8 ' 
that it is impossibls to design a set of patterns for which Fqq differs 
appreciably from F BQ and for which G^g differs from Therefore the 

ideal distribution of single points referred to above, vis, that no single 
points occur between branch-branch segments of the structure, is physically 
unattainable, and cannot even be approximated* 
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However, although the distribution of p ingle points between branch- 
branch segments cannot be made to differ radically from the distribution 
of single poiilts between branch-origin segments, it is possible by 
maximizing or minimizing G^^ or G^ to cause the number of branch points 
having as their two immediate predecessors two branch points or two orgins 

i 

to vary over a considerable range. It has already been demonstrated that, 
for N (B) held constant, the incidence of depth is very much greater for 
structures characterized by a type I limiting distribution than for those 
characterized by & type H limiting distribution. Therefore, it would seem 
quite logical to hypothesize that, with N (B) held constant, the incidence 
of depth for a general structure (one that is comprised of a much greater 
number of single points than of critical points) can be significantly 
increased or decreased by increasing or decreasing the number of branch 
points having as their two immediate predecessors two branch points. 

If the aforementioned hypothesis is valid, it then follows that the 

function G 0BB is an important parameter affecting the incidence of depth. 

To determine the statistical feasibility of the above hypothesis the following 

test was made: Two sets of patterns were constructed so that the structure 

generated by Pattern I was comprised of the same number of critical points, 

viz., 48 percent - 49 percent saturation, as that generated by Pattern II. 

Moreover, Pattern I was designed so as to make the function Q relatively 

Bmall, while Pattern II was designed to make G relatively large. Pattern I 

BBB 

and Pattern II are listed in Appendix C. Reference to the distribution in 
Appendix B Indicates that the probability of selecting at random a set of 
patterns which generates a structure in which the critical points comprise 
more than 49 percent saturation is less than 0.0002. Therefore, the 
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parameter N (B) ie very much larger for Patterns 1 and II than it would 
ordinarily be for an operational AFjAM 9 device* The actual value of N (B) 
for Pattern I was 57, ‘261, 313,280 branch points and for Pattern II was 
56, 862,000,000 branch points« 

It will be recalled at this point that the 6ycle structure of the AFSAM 9 
can be resolved into three pair-wise disjoint sets of structures (i»l,2,3) 
and that each set can in turn be resolved into 36 pair-wise disjoint subsets 
C 4 , (i-1,2,3, JO, 1,2, *... 35 ). It is easy to deduce that the number of 

J » 

branch points is the same for each one of the 10B sets of structures 

However, the function 0 will in general differ for each C. , The function 

BBB If J 

Hqq will, of course, take on three values, i.e. a value for each of the three 

sets of structures C. where C, ■ P C 4 . (i“l,2,3). For both Pattern I and 

1 1 i - 0 

Pattern II, it was decided to confine observations to the class of structures 
C l,0* For pattern I » Hoo< c i) - 2625, G B0B (C x ) • 90,578,250 (0.67 percent of 
all branch points in C^) and G (C^q) ■ 2,529,630 0.68 percent of all branch , 
points in C ljQ ) For Pattern II, Hqq (C x ) - 5195, 0 BpB (C^ - 3,991,7 54,880 
(21.06 percent of all branch points in C^) .md O 0BB (C^ Q - 110,911,080 
(21.07 percent of all branch points in C 1>0 *) 

The distribution of the J - statistic for a sample of size 30 was next 
secured for the two sets of patterns, I and II, where in both cases observations 
were limited to the set of structures (The J - statistic has already 

been defined in Part I as the number of messages that had to be sent in order 
to insure that the Jth massage was the first one yielding depth with any 
prior message.) These two distributions are given in Appendix D 0 
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The means for the above two sample dlstnoutlon were now compared to 
determine whether both distributions could have beta drawn from the same 
parent-population* Assuming that the two underlying distributions are normal 
with identical variances, one may test the hypothesis that the two means are 
equal by using "Student's".^ distribution* 

The measurement needed for testing the above hypothesis is computed from 
the relation 




\ 





, (?272) 
30 



-fm - 21.8 

The value of D, the difference between the two means, is 201*7 - 160*5« 
or 41*2. This value of 0 Is to be judged with reference to a hypothetical 
value of zero* Accordingly for T (the discrepancy expressed as^a normal 
deviate) we have 



T 



41.2 



21.8 



1.89 



If the true value of D were zero, a discrepancy as great as this or greater 
would occur as a result of chance more than 5 times out of 100 as a result of 
random fluctuations of sampling* Therefore, even with a standard significance 
of 1 out of 20, one cannot reject the hypothesis' that the two sample 
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distributions of the statistic j could have been drawn from the same parent 
population o 

poueyer p one cannot conclude on the basis of the results of the above 



test that the function Q is not a parameter affecting the incidence of 
j BB° 



depth | if the probability P (P) of a pair of messages/being in depth is 
indeed a function of N (B) and of Oggg P i.e» F (P) *^jN f (B), Ggg^jj , then 



the above test simply provides empirical evidence to Support the stateosnt that 

r ; 



“U°BBB 



N (B) - LB percent saturation 



Xt may very well be that if a similar test had been conducted for the case 
where N (Ej) ^ 3$ percent saturation 



F(P) 

^^BBB 



might have assumed significant proportions*. 
There is fljO logical reason to assume that 



filPl and 3 3- are Hnear c 

• 80 b™ ! 



Therefore pnother test to determine the effect of G^g on the incidenoe of 



depth jfhen M (B) 20 of 30 percent saturation has been conducted* In any 



case p pincp for. M(B) wax, Gggg - 0„ it is apparent that if 0g re is a parameter 
affecting j>he incidence of depthp it is not a very significant parameter when 



N (B) > lt8 percent saturation c 



Fpm patterns IF and V the j-statistio was obtained* A sample size of 
1*0 was- used with a 22*53 percent saturation. The same theory that was used on 
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pattern I and pattern II will be used to obtain the following results in 
the comparison of patterns IV and V 





4 "5 

2 r 

(122.7575) + (142.452) 






40 



- \f 884.690 

- 29 .7t 



The value of D is computed: 

M_ - M. - 279.93 - 206.75 - 73.18 

5 4 

thus: 

T m 73 , ’lft « 2.46 

From this T value one finds when compared with the normal curve or distribution, 
that this value places us at the 2 percent level of confidence of approximation 
of the normal curve. Since this sample of j-statistics so nearly fits the 
normal one may conclude that with the larger sample and the cut to 22.53 
percent saturation that this sample does come from the parent population. 

To provide some idea of just how sensitive a parameter N (B) is, a 
.distribution of j statistics for a sample of size 20 was obtained. This 
distribution, labeled distribution III, is given in Appendix D. Notch pattern 
Bet III (listed in Apendlx C) for which N (B) ■ 22.53 percent saturation, was 
issued to obtain this distribution. It will be noted that the mean for 
distribution III is significahtly greater than that for distributions I and II. 
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A more detailed analysis of the patterns I, JI, IV, V appears in Appendix B« 

« ' ' i - 

CONCLUSIONS : 



Logical deduction supplemented in some instances by statistical in- 
ference strongly- suggests that when N (B) is rslatively large, i.e, N (B) > $0 
percent saturation, the only important parameter affecting the incidence of depth 
is N (B) . However, it is quite probable that G_ no becomes a significant 

ODD 

parameter when N (B) assumes values that are. In the probability sense, npre 
normal, viz, 15 percent - 30 percent saturation. Additional statistical tests 
will have to be conducted before any conclusions can be made regarding this 
last possibility. 

Because the incidence of depth is so dependent upon the value of N (B), it 
is suggested that the AFSAM 9 be utilized in such a manner as will preclude the 
possibility of N (B) ever exceeding, say 25 percent or 30 percent saturation. 

This could be easily accomplished if it were not objectionable to extend' 
further the definition of admissability for notch patterns. For example, we , 
could define an adndssable "set" of patterns as one for which N (B) < 30 
percent saturation for all possible submits comprised of eight patterns. 

Only sete of patterns which are' admissible in the above sense would then be 



^ i i * i „ % - * w __ r ' ’i 1 * \ ^ 

distributed to each usdr, so that It would be impossible for any user to 

I m p 

■'1 , ‘ ' I -■ r • 

select from his set of patterns a subset of eight patterns for which 
N (B) > 30 percent saturation, 

- ^ „ i 

The above redefinition of admissibility would permit greater volume of 
traffic flow and would thus increase the efficiency of the device, provided, 
of course, that the security afforded by the AFSAM 9 isa function of notch 
pattern Identification rather than of recovery. 
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In this study no attempt was made to determine whether the incidence 
of depth is appreciably greater when message starts are restricted by an 
indicator system. The only data relevant to this question that are currently 
available are the statistical pictures in Appendix A. If probability of 
depth is not independent of the proposed indicator system, it would seem 
logical to conjecture that the alignment of the alphabet ring with respect to the 
notch pattern sequence is significantly affecting the incidence of depth. 

To determine whether the alphabet ring alignment is a depth parameter, 
the following test is proposed. For a suitable set of notch patterns two 
distributions of a j - statistic are obtained. One distribution is obtained 
for the case for which the alignment of the alphabet idngs minimizes the 
number of indicator settings that are origins. The other distribution is 
obtained for the case for which the alignment of the alphabet rings minimizes 
the number of indicator settings that are origins. The mean of the two 
distributions could then be compared to determine whether their difference 
is significant. 

Whether or not it will be necessary to undertake any investigations to 
determine the degree of dependence of F (P) on the proposed Indicator system 
has not yet been determined. 

1 "Some Genealogical Characteristics of the Cycle Structure of the 
AFSAH 9 For a Particular Set of Notch Rings” : ( R. P. Murphy and 
Warren Lota, C8A.2 

2 "Notch Rings Suitable for Use in AFSAM 7 and AFSAM 9 N : 

Marguerite D. Newell, C90.021A 
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Robert P. Murphy 
NSA-314 
March 1955 

This paper was in draft fora when Mr* Murphy resigned* Its 
preparation for publication has been the result of a co-operative 
effort of Marvin Bass, NSA-314, Bernard Harris, NSA-314 and 
Lt* William H* Cornelius, Jr., NSA-314. 
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In order to ascertain whether the clustering of message starts in 
various parts of the cycle structure exhibit any "non-random" characteristics 
when quasi-random or indicator starts are employed, a sample of 1,000 random, 
1,000 quasi-random, and 1,000 indicator starts were obtained* These were 
grouped according to points at which their successors entered a cycle* The 
frequencies of each group are presented in Table 1* 

To test the hypothesis that these three samples may be regarded ae three 
samples from the same population, we construct the likelihood ratio test for 
homogeneity of samples from a multinomial distribution* 

Let be the number of elements in the ith cell in the jth sample 

(i e l,2, . * *,m/ j “ l,2,***,k) 

Forming the likelihood ratio, we get: 



Let 






nj and J 
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- 2 log 




k-tym-l) 



2 J E Mj^log 



|( ] 



n ‘j 



J5u 



- leg E 

i " 



') 



Vo know that this quantity is approximately distributed as with 

\ 

(k-l)(m-l) degrees of freedom. (Logarithms are base a) , 

Applying the above testp we find o 17.62, and we therefore accept 

the null hypothesis at the • 0 5 level of significances '* 

In Table Up the average lengths of lead-ins to various cycle entry points 
are shown. Due to the non-orthogonality of the data, a regression analysis 
was not carried out at this time. Since the table suggests that random 'etarta 

i 

i 

may give rise to longer lead-ins, some analysis of thie type will be oarried 
out at some future date. The data does not suggest that results from the 
quasi-random starts would lead to less depth than the random etarta* 



am 



ENTRY 


RANDOM 




INDICATOR 


AA 


142 


147 


126 


AB 


194 


179 


173 


AO 


60 


64 


56 


BA 


133 


143 


131 


BB 


66 


72 


83 


BC 


30 


24 


24 


ED 


42 


30 


45 


BE 


71 


72 


85 


Residue 


260 


269 


273 



■ SECHM 1 



8888852 



• kJA^V>aXl.XJ X 



TABU II 



ENTRY 



RANDOM QUASI-RANDOM 



AA 

AB 

AC 



311110 

384011 

183975 

289334 

316828 

2004Q3 

240007 

323878 



295353 

386665 

198623 

282557 

309518 

168741 

206810 

321169 
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INDIflATOR 

272378 

373349 

177730 

283168 

300674 

185079 

228950 

346601 
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1 APPENDIX B 

i '*' 

/ 

PERCENT J PROBABILITY OF SELECTING PERCENT 

SATURATION A SET OF PATTERNS HAVINO SATURATION 

A HIGHER PERCENT SATURATION 


PROBABILITY OF SELECTING 
A SET OF PATTERNS HAVING A 
-aXGHKR faRCENT SATURATION 


00 


1.000 




26 


0.221 


01 


1.000 




27 


0.177 


02 


1.000 




28 


0.141 


03 


1.000 




29 


0.109 


04 


1.000 




3Q 


0.085 


05 


1.000 




31 


0.065 


06 


1.000 




32 


0.048 


07 


1.000 




33 


0.035 


08 


1.000 




34 


0.025 


09 


0.999 




35 


0.019 


10 


0.998 




36 


0.014 


11 


0.994 




37 


0.010 


12 


0.986 




36 


0.007 


13 


0.974 




39 


0.005 


14 


0.952 




40 


0.004 


15 


0.923 




41 


0.003 


16 


0.879 




42 


0.002 


17 


0.828 




43 


0.002 


18 


0.766 




44 


0.001 


19 


0.697 




45 


0.001 


20 


0.621 




46 


0.001 


21 


0.546 




47 


0.001 


22 


0.476 




48 


0.001 


23 


0.399 




49 


0.001 


24 


0.331 




50 


0.000 


25 


0.281 




51-99 


0.000 
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PATTERN I 


PATTERN II 


PATTERN- Ill 




1 


2 3 U 5 6 7 8 


12315678 


12345678 


0 


0 


1111111 


01111111 


11100000 


1 


1 


10 11111 


11100000 


01111001 


2 


0 


1 1 0 0 0 0 0 


01111111 


11101111 


3 


1 


0 0 0 0 0 0 0 


10100000 


00010111 


k 


1 


1111111 


11111111 


11010110 


5 


1 


10 11111 


11000000 


01010110 


6 


1 


1 1 0 0 0 0 0 


11011111 


11011001 


7 


0 


1 0 0 0 0 0 0 


01000000 


01101100 


8 


1 


0 111111 


lOOlllfl 


10111111 


9 


0 


0 0 11111 


00001100 


00101011 


10 


0 


1 1 0 0 0 0 0 


0 1 1 1 1 ri 1 


11001010 


11 


1 


1 0 0 0 0 0 0 


11101100 


11010010 


12 


0 


0 111111 


00111111 


10001101 


13 


0 


10 11111 


01101100 


01101000 


Ik 


1 


0 1 0 0 0 0 0 


10111111 


00111000 


15 


1 


10 10 111 


11001100 


01110111 


16 


0 


1111111 


01011111 


11101011 


17 


0 


0 0 0 1 0 0 0 


00001100 


00100001 


IS 


1 


1110 111 


11011111 


11010010 


19 


1 


0 0 11111 


10001100 


10111101 


20 


1 


110 10 0 0 


11111111 


01111000 


21 


0 


10 10 111 


01101100 


01101111 


22 


0 


0 111111 


00111111 


10100100 


23 


0 


1 0 0 1 0 0 0 


01101100 


11001111 


2 k 


1 


0 110 111 


10111111 


10111100 


25 


01011111 


01010000 


01100011 


26 


01101000 


01010011 


11100111 


27 


11010111 


11010000 


01011111 


28 


11111111 


11110011 


11111101 


29 


10101000 


10110011 


10111111 


30 


00110111 


00110011 


10100100 


31 


10101000 


10110011 


00010011 


32 


10111111 


10110011 


10010110 


33 


11100000 


11000011 


11010111 


3 k 


10111111 


10100011 


00011111 


35 


10000000 


10000000 


10011000 
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PATTERN IV 


PATTERN V 




1 2 3 4 5 678 


\ * 3 4 5 6 7 6 


0 


0 i i'll 111 


01111111 


1 


11100000 


11011111 


2 


01111111 


01101100 


5 


10100000 


10000000 


4 


111X2111 


11110011 


5 


11000000 


11011111 


6 


11111111 


1 t 1 0 1 1 OQ 


? 


01000000 


01001100 


8 


10011111 


10110011 


9 


00000000 


00010011 


10 


01111111 


01101100 


11 


11100000 


11001100 


12 


00111111 


00111111 


33 


01100000 


01010011 


14 


10111111 


10100000 


15 


11001100 


11001101 


16 


01011111 


01111111 


17 


00011111 


00011110 


18 


11011111 


11100001 


19 


10011110 


10000001 


20 


11111111 


11111110 


21 


01111110 


01011111 


22 


00111111 


00101101 


23 


01111111 


01000000 


24 


10111111 


10110011 


25 


01011111 


01011111 


26 


01011111 


01111110 


27 


11011111 


11001101 


28 


11110011 


11110011 


29 


10100001 


10110010 


30 


00100001 


00111111 


31 


10100001 


10101100 


32 


10100000 


10110011 


33 


11000000 


11110010 


34 


10100000 


10111111 


35 


10000000 


10000000 



(Both patterns hare the same saturation as pattern 3, vis 22=53 percent) 
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(.1 - statistics for messages of length 5000 ) 





DISTRIBUTION I 


DISTRIBUTION II 


DISTRIBUTION III 


1 


236 


230 


197 


2 


203 


199 


381 


3 


189 


156 


355 


6 


305 


283 


187 


5 


69 


118 


316 


6 


150 


237 


209 


7 


283 


75 


263 


8 


220 


216 


355 


9 


138 


166 


615 


10 


182 


166 


350 


11 


39 


151 


216 


12 


108 


105 


126 


13 


162 


132 


290 


16 


365 


20 


207 


15 


157 


192 


606 


16 


166 


159 


359 


17 


275 


85 


261 


16 


231 


126 


27 


19 


209 


379 


383 


20 


229 


106 


295 


21 


299 


187 


5592 


• 22 


228 


68 




23 


99 


162 




26 


156 


211 




25 


176 


176 




26 


266 


366 




27 


136 


122 




28 


118 


61 


- 


29 


636 


58 




30 


268 

?05<r 


-173.. . 
6815 






j - 201.67 


J - 160. 50 


3 - 279.6 




0 - 85.28 


a - 83.37 


a - 101.8 
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DISTRIBUTION IV 


DISTRIBUTION V 


1 


113 


303 


2 


310 


138 


3 


90 


390 


4 


341 


59 


5 


173 


174 


6 


60 


99 


7 


252 


421 


8 


536 


355 


9 


154 


150 


10 


56 


273 


11 


273 


116 


12 


83 


40 


13 


106 


364 


n 


294 


208 


15 


291 


250 


16 


109 


296 


17 


161 


606 


18 


122 


286 


19 


157 


83 


20 


221 


304 


21 


200 


559 


22 


71 


199 


23 


361 


384 


24 


277 


304 


25 


212 


459 


26 


207 


29 


27 


113 


48 


28 


208 


378 


29 


485 


. 384 


30 


140 


98 


31 


w 


262 


32 


82 


278 


33 


53 


260 


3 k 


358 


336 


35 


I83 


348 


36 


23 


439 


37 


461 


256 


38 


166 


449 


39 


380 


470 


40 


151 


342 


1 ■ 


206.75 


j - 279.925 


q o> 


122.7575 


9 - 142.542 
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APPENDIX E 

SUMMARY AND RESULTS 

I. The four sets of observations J-q, Jl2» »^21* ^22 were plotted on 
probability paper. No severe deviations from normality were noted, 
corresponds to pattern I, to pattern II, J.y to pattern IV, and Jgg 
to pattern V. 

II Sample means and sample variances were computed. The fact that 
the four sets of observations are heteroscedastic was noted and verified hr 
Bartlett's Test establishing significance at the 1 percent level. 

III An analysis of variance wae carried out by inverse weighting of 
variances per Theorem of David and Neyman ("Extension of the Markoff Theorem 
on Least Squares, " Statistical Research Memoirs., 2: 105-116 (1936)). 

Let % [Mil V#12 -J/21 U/22 

In accordance with the linear hypothesis write ; 

Y ijk i + + £ijk 

and 

Mu M'fix * >V 

Hq is then^^ ■ ■ 0. 

A is the effect of saturation level, ^ is the effect of the branching pattern 
within saturation level. 

£ ijk Is the residual error; furthermore, is N (0,o^j) # p or con- 

venience let 2/3^ “0, S * 0. Let o±^m w ^ ere ar * w ®tght9 used 

to correct for hetsroscedastxcxuy. Designate sample estimates of 

M>($ i. Vj, » 2 1J w K. b. 0. S. E bi - E C A - 0 
In the Neyman-David Theorem, the are known constants. In the problem, 
they were derived from the data, a procedure which will tend to weaken the 
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significance level of the test somewhat* In view of the large number of degrees 

2 

of freedom for estimation of each it is felt that this will not be too serious, 
A) To determine’ 'the effect _o£,,// and reduction of the total sum of Squares 
under the null hypothesis. 

Compute the least squares estimate of#* 



Assume: Y ijk -M+ £ ljk 
then M 



SSE ■ E Wj , 
ijk XJ 



( T ijk “ 



SSH - TSS - SSE 



TSS 



■ E W, 
ijk 







T?lr ' -\* k "u < T ijk - « - ° 



ijk * "ifj a *i w u 

is the number of observations in group 



H 



E 

i.ik 



M iJ Y ijk; 



ij " 1J 
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SSE * £ (ijjk — M) 
ijk 



'ijk WlJ IlJk * 2 "tjk WlJ Illk * * j b ij *»ij 



■ * »U X ljk - "PHu I ljk - M £ 

ijk Li Jk J iJ 




Since N 



13k W l,1 T ljk 
K iJ w U 



SEE ® £ Wj j 
ijk J/ 






ijk * 1Jk 

.vw 




£ W. 



ijk 



ij Hjk 



- M "u w ij 



2 

SSH T H 
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This gives the partition of the sum of equ&ree under the $jul3L 
hypothesis o Any effect due and must provide a further re- 

duction of SS5» Since the date is non-orthogonal we can not estimate 
pa rame ters separately but must repeat, the entire prooese of Obtain, siaul - 



taneous estimates! 



ut "i - Tj - °* 

then /< u - « ^ ■ K x > ^ - Ajj ■ /Cj 

"* T ijk ‘A* fii. * ^ijk 
*'♦'>! -/<?! 



«*’ 'ijk ^ * V 

^ S3B ' „.jt a (1. 41. - H' - bj) 

an ■ JJ,, "id '*ljk 1' 



bf - - 2 IWjj <*ljk* 141 - b i> " 0 



♦ *. &.« W n b i 



ifd.k IlJk ’ 13 & " 13 13 . 



* ^ T idk * * * “u “id^j “id “id 



SECBET 
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Since b^ ■ - bj^ by virtue of 5 b^ - 0 

4 w i3 hi k • "' fj hi hi * b i ] <»U w u - "y hi) 

4 w ij T yk " M ' * B ij w u * J "n w ij 

Solution of this set of equations provides the estimates 

for $v $ 2, Kr 



**' -4 hi < T ijk - M ’ - b i> 



Since M ♦ b^ “ Mi. 



SSE* - E V. . (r. - A ) 

ijk 1 J k 1 



- 2 A J\ ft 

"ijk W±J TiJk " 2 ^Jk ^ 1 * 
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SSE ’ " E W iJ T ijk “ 2 4 W 1J T ljk A x \* j N iJ hi fi ± 



SSE' - TSS - 2 



A 2 
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SSE - SSE* is the reduction in sum of squares due to the introduction 
of b^. 



F • 



Reduction 



where N_ is the number of degrees of freedom due to 



SSE'/Nq 

error may be used to test the hypothesis ^ ■ 0 
■ Now consider Bgrfijk "/(♦ * If j + £ ijk 

where f£ ^ not all sero and If ^ not all aero. 



As beforei compute the least squares estimates and the partition of 
the sum of squares* 



MS* * * »ij «ljk - * ' b i’- C / 
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i, 
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* hi r ijk - «* 5 «u «ij * b ’i * «u N ij ♦ * *1) »ij °j 

J » K J J f j 

£ w ij V ■ M " l w u "u * l h s "ij *"i * °j i h 3 »u 



Since b£ ■ “ 
and C 2 * “ C 1 



We get: 



1>3 f k h s V ■ , hi * b ’i * ■« - w =j hi) 



* °1 J (l<li - N i 2 *12) 



* k N U V - *" * hi hi 4 b 'i * hi hi 4 c i (M n w n - *12 V 



hi h 3 * ■ *■ l »u "ij * b 'x ("u hi - hi * 2 j) * c j l hi hi 

The solution of these equations nay be obtained using conventional 
matrix methods (e.g. Gauss - Doolittle, Crout, etc.). 

The reduction in sum of squares due to' error is obtained as follows: 
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SSE" - E W tJ ttijic - M" - b« ± - 0^) 
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Designate M M 4 + Cj by 
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A 2 

SSE« - L Wj x (^ijk " *\j) 



ijk 






A yy 2 

SSE" - TSS - 2E fc W ±j + ^ \l W iJ 

SSE* - SSE"i9 the additional reduction in sums of squares due to 
error obtained by -introducing Cj* 
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ANALYSIS OF VARIANCE 



Due to 


d.f. 


S.S. 


Total Sum of Squares 


UO 


6,760, 51t. 731 


M 


1 


. 152 >g02 


SSE 


139 


1.551,362.229 


Reduction due to b. 


1 


96.A91.8AT* 1 ' 
1,A5A, 870.382 


SSE' 1 


138 


Reduction due to C. 
SSE" J 


_JL 


87.' J52.187* 


137 


1,367,518.195 



* Bignif leapt at .01 level. 

The hypothesis ^ 12 " ^Czi " ^(22 i8 then ejected at the 

•OX level and the N 'F-test n indicates a significant effect due to both 
saturation level and branching pattern for these four sets of observations* 
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J ll 


1 Y ljk 
2 


4,815 


i r ijk 


974,353 


w u 


1.439 


"13 


30 


■H 

35 

■H 


43.170 



E 6,928.785 

2 

SW iJ T iJk 1^02,093.967 

M 

U' 



SUMMARY OF COMPUTATIONS 



J 12 


J 21 


6,050 


8,270 


1,430,970 


2,312,602 


1.375 


.647 


30 


40 


41.250 


25 .880 


8,318.750 


5,350.690 


967,583.750 


1,496,253.494 



200,562 

209.265 



J 22 


Tbtal 


11,197 


50,332 


3,947,049 


8,664,974 


.480 




40 


140 


19.200 


129.500 


5,374.560 


25,972.785 


1,894,583.520 


6,760,514.731 



b i 

A 



n 

M 



»’i 

c j 

A 

lj W 1J "ij 



- 28.650 


28.650 


180. 615 


237.915 




211.494 


-30.286 


30.286 


-26.076 


26.076 


155.132 207.284 


215.704 267.856 


6,697.04844 8550.465 


5582,41952 5142*3352 



SECRE T 



s 



SSE 1,551,362.229 
SSM 5,209,152*502 
SSE' 1,454,870*382 
SEE" 1,367,518.19$ 



2 







BARTLETT'S TEST 



S^/N^ log S^/Nj Nj log 3^/R 



201, 5*5*50 N x 

210.866.50 N 2 

602.779.50 N 3 
812,672.79 

1,827,86**29 N 



29 69*9.84 

29 7271.26 

39 15455.88 

39 20837.76 

- 136 13*40.18 



8.8*806 256*59374 
8.89467 257.94543 
9.64750 376.25250 
9.94633 387.90687 
9.50771 1293*04856 



» i»g f - 1 * , i io s -5^ 

- 1293.0*856 - 1278.69854 



- 14.35002 



2 „ 2 
S i “ E X i “ X 1 

c-i 

* «• ( .121786 - .007353) 

- 1 ♦ .01907 - 1.01907 
M - 14.0815 

H la significant at .01 level. 
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